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Abstract 



Formal calculi of record structures have recently been a focus of active research. However, 
scarcely anyone has studied formally the dual notion — i.e., argument-passing to functions 
by keywords, and its harmonization with currying. We have. Recently, we introduced 
the label-selective A-calculus, a conservative extension of A-calculus that uses a labeling of 
abstractions and applications to perform unordered currying. In other words, it enables some 
form of commutation between arguments. This improves program legibility, thanks to the 
presence of labels, and efficiency, thanks to argument commuting. In this paper, we propose 
a simply typed version of the calculus, then extend it to one with ML-like polymorphic types. 
For the latter calculus, we establish the existence of principal types and we give an algorithm 
to compute them. Thanks to the fact that label- selective A-calculus is a conservative extension 
of A-calculus by adding numeric labels to stand for argument positions, its polymorphic typing 
provides us with a keyword argument-passing extension of ML obviating the need of records. 
In this context, conventional ML syntax can be seen as a restriction of the more general 
keyword-oriented syntax limited to using only implicit positions instead of keywords. 



Resume 



Recemment, les calculs formels de structures d'enregistrement ont fait l'objet de recherche 
active. Nonobstant, presque personne n' a etudie formellement la notion duale — en l'occurrence, 
le passage d' arguments par mot-clefs, et son harmonisation avec la curryfication. Nous nous y 
sommes interesses. Nous avons introduit recemment le A-calcul label-selectif, une extension 
conservatrice du A-calcul qui utilise un etiquetage des abstractions et des applications pour 
permettre la curryfication dans le desordre. En d'autres termes, il autorise une forme de 
commutation entre les arguments. Cela ameliore la lisibilite, grace a la presence des etiquettes, 
et l'efficacite, grace a la commutation des arguments. Dans cet article, nous proposons une 
version simplement typee du calcul, puis nous l'etendons avec des types polymorphes a la 
ML. Pour ces derniers, nous etablissons 1' existence de types principaux et nous donnons 
un algorithme qui les calcule. Grace au fait que le A-calcul label-selectif est une extension 
conservatrice du A-calcul obtenue en y ajoutant des etiquettes numeriques correspondant aux 
positions d'arguments, son typage polymorphe nous fournit une extension de ML avec passage 
d'arguments par mot-clefs sans necessiter de structures d'enregistrement. Vue ainsi, la syntaxe 
conventionnelle de ML peut etre percue comme une restriction d'une syntaxe plus generale 
orientee mot-clefs qui serait limitee a seulement utiliser des positions implicites au lieu de 
mot-clefs. 
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They were both very pleased with this new view of the 
matter, which did credit to them both, and we all parted 
on the most friendly terms. 

Robert Graves, /, Claudius 

1 Introduction 

The use of symbolic labels in programming languages is not new. This has been done 
in two ways. The first one, common to nearly all languages, is as field designators in 
record structures. Relatively recently, formalisms for records have been proposed. This 
started with Cardelli [6], was later extended to a second order calculus [7], and was followed 
by a number of record-type inference systems compatible with ML-style polymorphic type 
inference [22, 20, 13, 19]. Even more recently, a compilation method was proposed by 
Ohori [18], for an extension of A-calculus containing poiymorphically typed records. 

Another way to use labels in programming languages has been as keywords for parameter- 
passing in procedure or function calls. This is the case in Common LISP [21], ADA [15], 
and LIFE [4]. However, in Common LISP or ADA, currying is not supported, which makes 
the situation rather mild. Although currying is supported in LIFE, even with keywords given 
in a different order, it is restricted nonetheless and does not accommodate implicit positions 
as it should. Indeed, fully flexible currying with the presence of keywords as well as explicit 
and implicit positions was until recently a still unexplored issue. Some proposals do offer this 
convenience of parameter-passing without modifying the core calculus [14, 17]. However, 
these are based on using a notion of store; that is, bindings from names to values. This 
introduces another parameterizing system, independent from A-calculus. Even so, to our 
knowledge, no typing system has been proposed for them. 

Our own proposal, as originally reported in [2], is to support this new convenience 
of labeling arguments directly in A-calculus and accommodate selective unordered currying 
through commutation of arguments. In our view, the role of arguments is determined by their 
labels, which interact with their order. 

Selective A-calculus introduces two types of commutations. The first, and most immediate, 
is between symbolic labels. By analogy with tuples, when currying an expression f{p =>- 
a,q^b, . . .) we obtain an expression ((f(p^a))(q^b))(. . .). But since there is no reason 
to apply / in this specific order, using the freedom provided by labels allows to curry in a 
different order; e.g., ((/(g=> a))(...). Suppressing superfluous parentheses, and 

limiting our consideration to two arguments, we obtain that the following equality must hold 
in our calculus: 

f(p=>a)(q=>b) =f(q=>b)(p=>a). 

However, this is true under a restriction: p and q must be distinct labels. Successive 
applications on the same label must not commute. Indeed, if the labels are equal, the order of 
these applications must be obeyed to be unambiguous. 

Here is an example of the use of these symbolic labels for the list constructor, in an 
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ML-like language, together with inferred types. 

#let cons car=>a cdr=>b = a::b;; 
cons : { car=>' a, cdr=>' a list} -> 'a list 

Icons cdr=> [ 1 ] ; ; 
it : {car=>int} -> int list 

The second commutation equality comes from a reversion of the analogy with tuples. 
That is, we can see a tuple as a record labeled with numbers: (a, b, ...) = (l =>a, 2=>fe, .. .). 
If we applied the equality used for symbolic labels, we would obtain /(l =>- a) (2 =>- b) = 
f{l^-b){\ =>a). But, since it is better to see unary application as implicitly using the label 1 
and keep conventional currying, we would rather write /(l =>- a)(l =>- b), or simply, / a b as 
usual. To make this possible, we must define commutation differently on numbers: namely, 
/(2=>6)(l=>a) =/(l=> fl )(l=>fe). This can be generalized as: 

f(m^a)(n^b) = f(n^-b)(m - l=^a) if m > n. 

For instance we can use it as follows, (omitting explicitly labeling with 1=>): 

#let sub x y = x-y; ; 
sub : { l=>int, 2=>int } -> int 

#let minusl5 = sub 2=>15; ; 
minusl5 : {l=>int} -> int 

This second commutation equality is in fact orthogonal to the first one. Commutation on 
symbolic labels expresses the intuitive possibility of taking input on multiple channels, while 
the numeric form gives a control on the relative precedence order of input on a given channel. 

Selective A-calculus provides the above equalities for symbolic and numerical labels 
for both application and abstraction. As an untyped calculus, its confluence has been 
established [3], along with fundamental properties of A-calculus like Bdhm's theorem [12]. 

Similarly, the introduction of label-selective types providing simple types for selective 
A-terms is done in the same manner as that of simple types in classical A-calculus. The 
essential difference is that, in order to emphasize the intrinsic commutativity, we will put 
on the same level all argument types to a function. For instance, the cons^ operator, 
namely cons int {car h : int, cdr =4> t : int list) = (h :: i) for integer lists, should get type 
{car =>- int, cdr ^ int list} — > int list. Such a notation shows that it is possible to apply consi nt 
on both car and cdr labels, and that the result is a list of integers. 

Then we build a polymorphic typing system a la ML for selective A-calculus. As 
for ML-style polymorphism, a type inference algorithm exists, which obviates the need for 
explicit typing. In other words, this means that we can integrate labeled parameters in any 
ML-like programming language. Continuing with the previous example, for the definition 
cons(car^h,cdr^t) = (h :: i), we can infer the type \/a.({car^a,cdr^a list} — > a list). 

1- We use a notation close to CAML [10]: "let" denotes a definition, " : : " the list constructor. Since "=>" is 
left unused (abstraction uses "->"), we use it for labeling. 
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Such a type system is particularly well-adapted to selective A-calculus, thanks to the 
incrementality of typing, which goes together with application. On the other hand a second 
order type system, separating type application, would limit commutation possibilities by 
introducing new dependencies between abstractions. 

Section 2 gives a practical and theoretical motivation for our type system. We then 
define symbolic and numerical label-selective A-calculus in Section 3 and 4, combining them 
in a product system in Section 5. Sections 6 and 7 present respectively simple typing and 
polymorphic typing of the selective A-calculus. To avoid cluttering the casual reader's attention 
with unnecessary details, we have relegated all proofs to the appendix. 

2 Motivation 

The calculus we present has practical and theoretical motivations. In practice, the use 
of labels for argument selection enhances clarity and obviates the need of argument-shuffling 
combinators. From a theoretical perspective, the commutation laws of labeled arguments 
readily render natural type isomorphisms in A-calculus. 

2.1 Keywords: an enhancement for clarity 

We start here by giving some examples of how the use of keywords, and their appearance 
in types, may help the programmer. Our view is already partially proven by the ubiquitous 
use of records as data structures. While theoretically everything could be done with tuples, 
one will often prefer using a record, gaining abstraction over a representation using explicitly 
ordered formats. 

Here are some examples of functions written in an ML-like syntax, with their inferred 
types. 

#let rec map function=>f = fun 

# []">[] 

# | [h|t] -> (f h) : :map function=>f t; ; 
map : {l=>'a list, 

function=>{ 1=>' a} -> 'b} -> 'b list 

#map function=> (add 1) ; ; 
it : {l=>int list} -> int list 

#map [ 1 ; 2 ; 3 ] ; ; 
{ function=>{ l=>int } -> 'a} -> 'a list 

The advantage of this labeling system is twofold: it is more expressive and it allows doing 
partial application selectively on any label. 

One could argue that in the functions above, order is clear enough so that, even without 
labels, there is no possibility for error. However this becomes less systematic for functions of 
three arguments or more. Moreover, it is not so natural in some two-argument functions. This 
is the case, for instance, of mem (membership in a list) or a s soc (retrieval from an association 
list), whose respective types are: 

mem : ' a -> ' a list -> bool 
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assoc : 'a -> ('a * 'b) list -> 'b 

There is no special reason for them to respect this particular order. In fact, the opposite order 
of arguments would appear more natural, since currying with a given list is more likely. Here, 
a quick glance at the type eliminates any ambiguity. However, this is not always sufficient. 
Even if such was the case, the following types would certainly be more perspicuous: 

mem : { 1=>' a, in=>' a list} -> bool 

assoc : { 1=>' a, in=> ( ' a * 'b) list} -> 'b 

With this, one can define such a function as: 

#let digit = mem in=> [ 0 ; 1 ; 2 ; 3; 4 ; 5; 6; 7 ; 8 ; 9 ] ; ; 
digit : {l=>int} -> bool 

This clearly improves legibility. 

Still, one may shrug this argument off since with two arguments, there are only two 
possibilities of order. With more arguments, however, this quickly becomes irksome. Clearly, 
remembering arguments order for functions of more than three arguments — and those are not 
so uncommon — is out of the question. 

Let us give some more examples. Consider, for instance, it_list and list_it (fold 
left and right), with types: 

it_list : ('a -> 'b -> 'a) -> 'a -> 'b list -> 'a 
list_it : ('a -> 'b -> 'b) -> 'a list -> 'b -> 'b 

An explicit labeling such as: 

it_list : {!=>'& list,op=>{l=>'b,2=>'a} -> 'b,zero=>'b} -> 'b 
list_it : {l=>'a list, op=> { 1=>' a, 2=>' b } -> 'b,zero=>'b} -> 'b 

would be more expressive, making the types easier to understand. 

We have deliberately restricted our examples to generic functions, for which currying is 
useful. If we consider functions interfacing a window manager, for example, the number of 
arguments per function is such that the use of labels is a necessity. In that case, however, one 
could do with records, since currying is not so important. Nevertheless, the trend in functional 
languages is towards a systematic use of currying. Standard ML is a notable exception, 
preferring uncurried functions, but CAML is an example of an ML dialect preferring currying. 

2.2 Relative positions versus combinators 

If the main benefit from using symbolic labels is expressiveness, that of relative positions 
is in conciseness — and efficiency. 
Consider, for example: 

#let cons a b = a::b;; 
cons : {l=>'a,2=>'a list} -> 'a list 

#map f unction=> ( cons 2=> [ 1; 2 ] ) ; ; 
it : {l=>int list} -> int list list 

#map function=> (sub 2=>10) [11; 12; 13];; 
it = [1;2;3] : int list 
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Of course, the same effect can be obtained using the C combinator defined as: 

#let C f x y = f y x; ; 
C : ('a -> 'b -> 'c) -> 'b -> 'a -> 'c 

#map (C sub 10) [11; 12; 13];; 
it = [1;2;3] : int list 

But, besides legibility, the hidden loss is efficiency: a combinator is an explicit closure to 
build and reduce, whereas label commutation enables direct access into the argument stack 
with offsets. Moreover, for more than two arguments, currying on the Mi argument would 
necessitate k — 1 such swaps, or use a special combinator for each position — just as expensive. 

In addition to this obviously practical benefit, relative position labels provide a coherent 
bridge connecting classical currying and record currying. 

2.3 A generic commutation capability 

With respect to types, we can see these extensions as the integration into A-calculus of the 
natural isomorphism: 

A x B ~ B x A, 
which, combined with currying, 

Ax B -» C ~ A -» (B -» C), 
gives: 

A -» (B -» C) ~ B -» (A -» C). 

This isomorphism becomes clearer when using indexed products, as in category theory, 
with explicit projections tti and -ki\ 

(7Ti=^A) x (tt 2 ^B) ~ (tt 2 ^B) x (tti^A), 

and thus: 

A) -> ((*2=>fl) -> C) ~ (t^S) -> ((ttj^A) -> C). 

Therefore, we obtain a type system in which these isomorphisms, which are part of those 
described in [5], are directly included. 

If we want to keep a confluent calculus, however, it is necessary to sacrifice either 
generality (two identical keywords may not commute) or referential stability of positions (new 
projections after commutation). For this reason positions are necessary to allow commuting in 
any case. They ensure that association between an abstraction and an application is invariant 
even if their respective positions change. This is important operationally as they allow direct 
access to distant arguments (i.e., deep in the stack). While symbolic labels are a useful 
extension of currying, numerical ones are similar to de Bruijn indices [9]. 



Research Report No. 35 



October 1 993 



6 



Jacques Garrigue and Hassan ATt-Kaci 



3 A-Calculus with multiple channels 

To obtain the behavior that we illustrated with keywords, we define an extension of the 
A-calculus, the symbolic selective X-calculus, with symbolic labels. 

Selective A-terms consist of variables, taken from a set V, and two labeled constructions: 
abstraction and application. We shall assume a non-empty, totally ordered, set of symbols S, 
to use as labels. We will denote variables by x,y, labels by p, q, and A-expressions by capital 
letters. The syntax of selective A-terms is then given as: 

M ::= x (variables) 



We will say "to abstract x on p in M", "to apply M to M' through p". These terms will always 
be considered modulo a-conversion. 

To make this compatible with the classical A-calculus, we shall distinguish a special label, 
written i, to use as default. 2 That is, any unlabeled abstraction or application is interpreted as 
being labeled by i. In other words, classical A-calculus is the special case when S = {*.}. 

The reduction rules for this calculus are given in Figure 1. /3-Reduction only happens 



Reduction: 

09) {\ p x.M)~N^[N/x]M 
Reordering: 

(1) \ p x.\ q y.M — > \ q y.\ p x.M p > q 

(2) M-NqP — M~P~N p>q 

(3) {\ p x.M)~N — \ p x.{M~N) p^q, xg FV{N) 



Figure 1 . Reduction rules for symbolic selective A-calculus 

on abstraction-application pairs with the same label. 3 Otherwise they commute by rule (3). 
Rules (1) and (2) simply normalize the order of abstractions and applications. 

For convenience, we will sometimes use a variant syntax using record notation. A record 
is an expression of the form => Afi, . . . ,p„^M n ) the p,-s are labels and the M,s are terms. 
We shall use these expressions with the following syntactic equivalence: 



2 It will be convenient, though not necessary, to assume that i is the least element of S. 

3 The notation [N/x]M denotes the term obtained from M after substituting all the free occurrences of variable x 
with the term N. 



X p x.M 
MpM' 



(abstractions) 



(applications). 
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X(pi^x u ...,p n ^x n ).M = \ pi x\. - ■ ■ .\ Pn x n .M 
M(p 1 =>M 1 ,...,p n =>M n ) = (...(Mf 1 M 1 )...f„M„). 

An example of reduction in the symbolic selective calculus is given in Figure 2 



We suppose that p < q < r < s: 

(X{p^x,q^y, r=>z)M){r=>N, s=>P,p=>Q) 

= {X p x.X q y.X r z.M)7N7P7Q 

^3 (X p x.((X q y.X r y.M)7N))7P7Q 

-> 2 (X p x.((X q y.X r y.M)7N))pQ7P 

-V, {X q y.X r z.[Q/x\M)7N7P 

^3 (A ? y.((A f z.[e/x]M)?^V))?P 

->/? {\y-{[Q/x][N/z]M))7P 

^3 A 9 y.(([e/x][A?/z]M)?P) 



Figure 2. Example of reduction with symbolic labels 

We call symbolic selective X-calculus the free combination of these rules and a-conversion. 
Theorem 1 The symbolic selective X-calculus is confluent. 

Proof: Consequence of the proof for selective A-calculus, in [2] | 

4 A-Calculus with relative positions 

This calculus is very similar to the previous one. Its syntax is identical; the only difference 
is that the labels are positive natural numbers: 

M'-' = x | \ n .M | M7M' where rGJVcIV- {0}. 

Again, for compatibility with the classical A-calculus, we shall use position 1 as default. 
That is, any unlabeled abstraction or application is interpreted as being labeled by 1. In other 
words, classical A-calculus is the special case when J\f = {1}. 

The reduction rules are also similar, but with a twist. They are are given in Figure 3. The 
main idea here is to preserve coherence between argument position numbers and the property 
used for currying that all functions are unary. Hence, it is necessary to adjust a position number 
relatively to the form on its left. 

Similarly to what we do for symbolic labels, we will also use a number-labeled record- 
syntax variant of the raw syntax for convenience. However, unlike the freely commuting 
symbolic labels, the numbers used as labels in record notation do not correspond directly to the 
relative position labels of the raw syntax. Namely, translating from the record syntax to raw 
syntax must readjust an argument's position index by subtracting an offset equal to the number 
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Reduction: 

{/3) {\„x.M)~N^[N/x]M 
Reordering: 

(4) \ m x.\„y.M — > \ n y.\ m _\x.M m> n 

(5) M^NnP^M-P^iN m>n 

(6) {\ m x.M)~N^ \ m - lX .{M~N) m>n,x(£FV{N) 

(7) {\ m x.M)~N^ A m x(M„-TiiV) m<n,x(£FV{N) 



Figure 3. Reduction rules for numerical selective A-calculus 

of arguments of lesser position indices on its left. More precisely, let (n\ =>- M\ , . . . , n^ =>- M^) 
be a record expression where n ; - 6 M for i = 1 , . . . , k. Then, for any i = I,..., kin this 
expression, its relative position offseth the number o(i) of labels in the set {rci, . . . , that 
are strictly less than n { . For example, the relative position offsets of the record expression: 

(4=>M 1 ,1=>M 2 ,5=>M 3 ,2=>M4,2=>M 5 ) 

are: o(l) = 0,o(2) = 0,o(3) = 2,o(4) = l,o(5) = 1. 
Hence, the syntactic equivalence is given by: 

A(ni=^xi,...,%=^).M = A ni xi A njt _ 0 ( fe )Xi.M 

M( ni =>M u ...,n k =>M k ) = (...(M^Mi) ... nk r^ {k) M k ). 
An example of reduction in the numerical selective calculus is given in Figure 4 
Theorem 2 The numerical selective X-calculus is confluent. 

Proof: Consequence of the proof for selective A-calculus. | 

5 The selective A-calculus 

The selective A-calculus combines orthogonally the symbolic and the numerical selective 
A-calculi by using C = S X A/" as set of labels. 4 Thus its syntax is: 

M:: = x | \ t .M \ M^M' where l=pn£C = SxN. 

The reduction system is the combination in Figure 5. Applying these rules simply amounts 
to applying independently the symbolic and numeric systems. One may see reordering rules 
as structural equalities, and /3-reduction as unique reduction rule. Since the combination is 
orthogonal, it inherits confluence from both systems. 

4 In [2] this particular variant was defined as a product system, and what we called there selective A-calculus as 
the sum system C = S U Af. Properties of the two systems being similar, we work here on the most general one. 
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(A(2=^x, l=^,4=^z).M)(4=^/V,6=^P,2=^g) 

= {\ 2 X.\ iy .\2Z.M)^N^P2Q 

-» 4 {\ 1 y.\ 1 x.\2Z.M)^N7PiQ 
-» 7 (A 1 y.((A 1 x.A 2 z.M)^))5P2«3 
-> 5 {My.{{Mx.\2zM)3N))2Q^P 

-> 7 {\ iy .\ lX .{{\ lZ .M)^N))2Q4P 

(\iy.\ix.[N/z\M)?Q?P 
-> 7 (My.{(Mx.[N/z]M)tQ))?P 

(My.[Q/xW/z]M)?P 
-7 My-W/xW/WsP) 



Figure 4. Example of reduction with numeric labels 



Reduction: 

{/3) (X t x.M) ?N -» [7V/x]M 



Symbolic reordering: 

(1) A^x.A^y.M -> \ n y.\ m x.M p>q 

(2) Mf m Nq n P^M~Pf m N p>q 

(3) ( A pm x.M) f„ /V - \ pm x. (M ~ N) P ^q,x$ FV(N) 

Numeric reordering: 



(4) \ pm x.\ pn y.M^ \ pn y.\ pm _ix.M m> n 

(5) M~ m Np n P^M~ n P^r x N m>n 

(6) (\ pm x.M)~N -» \pm-\x\M~N) m>n,x<£ FV{N) 

(7) ( A^x.M) ~ AT -» A^x. (Mp, /V) m<n,x<£ FV{N) 



Figure 5. Reduction rules for selective A-calculus 
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Theorem 3 The selective X-calculus is confluent. 



Proof: This is a consequence of the proof for the sum system in [2]. We extend easily the use of 
numerical indices, which can be seen as being limited to a only one keyword in the sum system, to 
all keywords thanks to channel independence. | 



To let this system include the symbolic calculus and numerical calculus as sub-calculi, we 
will identify a symbolic keyword p in the former with the label (p, l), and a numeric index n 
in the latter with the label (i, n). Thus, the classical unlabeled A-calculus is also syntactically 
embedded in selective A-calculus by taking (i, l) as the default label of all abstractions and 
applications. 



6 Simple types 

As in classical A-calculus, we introduce simple types. There are two benefits. First, 
we gain a better understanding of the label-selective calculus itself by explicating the type 
structure that it needs. Second, simply typed selective A-calculus gains the same nice expected 
properties; e.g., strong normalization of well-typed terms. 

6.1 Syntax and types 

The original syntax of terms is extended to: 

M:: = x\ \ix:t.M \ MfM'. 

which requires abstracted variables to be explicitly typed. 

We define the syntax of label-selective simple types with the following grammar: 

I ::= pn (labels) 

u ::= u\\u2\... (base types) 

r ::= {l=>t,...} (record types) 

t : : = u | r — > u (general types) 

where the expression {l=>t,...} denotes a finite partial function from C to types, including 
the empty function {}. We shall identify a functional type of the form {} — > u with the base 
type u. Note that record types are not types of expressions of our term language. They are 
used exclusively as the left subexpression of function types. 

The idea behind this syntax of types is to convey that an application can be done 
indifferently through any label that is present in the type, on a value of corresponding type. 

6.2 Record concatenation 

We shall provide a simple-type inference system as expected. In order to do so, we 
must define a record-type concatenation operation needed for extending the domain type of a 
functional type. Before we give it formally, it is preferable to build some preliminary intuition. 
We will illustrate the essential mechanism on an example. 
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To simplify the discussion, let us first restrict ourselves to numeric labels only. Consider 
the two record types r = {2 =>- t\ , 4 =>- t{\ and s = {2 =>- u\ , 3 =>- ui}- Extending the type r 
on the right with s must be done such that the relative positions be kept in coherence. Now, 
r expects t\ in second position and ti in fourth position. In other words, positions 1, 3, 5 and 
up, are "free" in r in the sense that if more arguments were to be expected by an extension 
of r, they could use these free slots in sequence. Consider now extending r with s. The first 
argument's position in s is 2. Hence, in r's context, this argument corresponds to the second 
"free" slot; i.e., position 3. The following one in s is in position 3, and hence corresponds to 
the third "free" slot in r; i.e., position 5. Thus, the record type resulting from the concatenation 
of r and s is r • s = {2=>- t\, 3 =>- u\, 4=>- t2, 5 =>- 112}. 

The case of multiple channels is not more complicated since the above scheme is to be 
used on each channel independently. Intuitively, this operation reminds of stream merging. In 
fact, this is exactly what is happening as the indices on a given channel in a record indicate the 
expected positions, but only relative to this specific record. Extending the record with more 
indices on this channel necessitates adjusting the new indices by taking their positions with 
respect to the sequence of indices unused by the initial record. We now proceed to defining 
formally this record-type concatenation operation. 

Let r = {£i =>- t\,... ,l n =>• t n } be a record type. We shall denote by V r = {l\ , . . . , £„} the 
set of labels defined in r. Recall that our record labels are not simple symbols, but pairs of the 
form pn, a symbol and a position index. 

Definition 1 (occupied position) The n th position on p in a record type r is said to be 
occupied ifr is such that pn £ V r . 

Given a record type r, we denote by o r (pn) the offset of n on p in r to be the number 
of occupied positions on symbol p in r with index less than or equal to n. That is, 
o r (pn) = \{{p} X [l,n]) HV r \. 

For example, consider the two following record types: 

r = {p2^ti,p4^t 2 ,ql^t 3 ,q2^t4,q5^t 5 }, 
s = {p2^ui,p3^U2,q2^ii3,q3^U4}. 

The offsets of the labels of s in r are, respectively, o r (p2) = 1, o r (p3) = 1, o r (q2) = 2, and 
o r (^3) = 2. 

Given a record type r and a given symbol p, we need to identify the least index of p in r 
that is not an occupied position. More precisely, it is useful to know the nth such free position 
for symbol p in r. 

Definition 2 (free position) The n th free position/or p in r is given by: 
(/> r ,p(n) = min{/ 6 J\f \ i - o r (pi) = n}. 

For example, given the previous example's two record types, the free positions in r 
available for the indices in s are, respectively, (j> r , P {2) = 3, (/> r , P (3) = 5, (/> r , q (2) = 4, and 
M3) = 6. 
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For fixed r, this function is extended to work also on a record type by distributing it on 
each label. Namely, for any s = {p ; n ; =^}f =1 , <j> r {s) = ^ ^iLi- 

For example, for r and 5 used above: (/) r (s) = {p3^ui,p5^U2,q4^U3,q6^U4}. It is 
not coincidental that the label domain of the c/) r (s) is disjoint from that of r. It is easy to show 
that this is true in general. 

Definition 3 (concatenation) Record-type concatenation is defined as r-s = rb)(/) r (s), where 
I+J denotes union of functions with disjoint domains. 

Going back to the two record types r and s used in the examples above, we have 
r-s = {p2^ti,p3^ui,p4^t2,p5^U2,ql^t3,q2^t4,q4^U3,q5^t5,q6^U4}. 

Proposition 4 Label-selective record-types form a monoid; i.e., concatenation is associative 
with neutral element {}. 

Proof: This is because <fr r . s = <fr r o <fr s (see appendix). | 

6.3 Record matching 

It is essential for a syntax-directed inference system, like the typing system that we are 
about to give, to be able to solve syntactic equations of the form r ■ x = s. More specifically, 
to extract a subexpression r out of a record type expression it is convenient to write the latter 
as r l+J s {i.e., splitting it), and let that be the result of an expression r • x, solving for x. 

Remarkably, there is an inverse to record-type concatenation that allows solving such an 
equation and thus may be used to identify a given record type as the result of the concatenation 
of two other record types. We call this operation record-type matching. 5 It will be used with 
great benefit in typing rules as well as for polymorphic type unification and type inference as 
shown in the next section. 

Let r and s be two record types with disjoint label domains (i.e., such as could be obtained 
by partitioning one into two). Let pi be a label in s. For p, the position i can be seen as the 
result of having concatenated r with the same type originally at position i — o r [pi). In fact, for 
all the label indices i of p in s, this defines an inverse function for (j> riP as (j>~p (?) = i — o r (pi). 
That is, (/)^p(i) computes the index corresponding to i on channel p skipping the occupied 
positions on p in r that are less than or equal to i. 

As before, for fixed r, is extended to record types. Namely, for any s = {/?,-«,■ =>■ U}* =1 
such that pm for all i= 1, . . . , k, ^{s) = {p/^.(n,-) =>f/}f =1 . 

Definition 4 (matching) Record-type matching is defined as r 1+) s = r • (j>r l {s), where l+J 
denotes union of functions with disjoint domains. 

Let r = {pi =4> t\, ql =>- ti\ and s = {p2 =4> u\, q3 =>- ui}. The unique solution to the 
matching equation r 1+1 s = r • x is x = (j>~ 1 (s) = {pi =^ u\, q2^ 1*2}- 

5 Although division should be more appropriate. 



October 1 993 



Digital PRL 



The Typed Polymorphic Label-Selective A-Calculus 



13 



6.4 Typing rules 

We now have all we need to define well-typedness. We will denote by J" a typing 
environment; i.e., a mapping from term variables to types. The notation T[x i-> r] denotes the 
typing environment that coincides with r everywhere, except on x for which it gives the type 
r. 



Definition 5 A term M is well-typed if there is a mapping r from the free variables of M to 
types and a type r such that T h M : r is derivable in the type inference system of Figure 6. 



r[x \— > t] h x : t 

r[x i-^> 6] \~ M : r — > t 
r h \tx:6.M: {1^6} ■ r^r 



r h M : {1^6} ■ r -> r r \- N : 6 



(I) 



(II) 



(HI) 



Figure 6. Typing of simply-typed label-selective calculus 

Simply typed selective A-calculus verifies the two fundamental properties of typed 
A-calculi. 

Proposition 5 (subject reduction) Reduction preserves the types; i.e., if T V M : t and 
M -> N then r \- N : t. 

Theorem 6 (Strong normalization) The simply-typed label-selective A-calculus is strongly 
normalizing. 

7 Polymorphic selective A-calculus 

While there exist typing systems that are more powerful than ML's (e.g., second-order 
polymorphic A-calculus), the style of polymorphism used in ML is much simpler. This is 
essentially due to restricting type quantification to appear only at the outset of type expressions, 
which facilitates type instantiation to be done implicitly following applications. The main 
advantage of this type system is that, for A-calculus, any term has a principal (i.e., most 
general) type that can be reconstructed from the shape of the term alone. This obviates explicit 
type declarations: a simple type unification algorithm synthesizes missing types. 

We show here that this form of polymorphism is valid also for label-selective A-calculus. 
This means that, from a typing point of view, the addition of labels is coherent with 
polymorphicaily typed A-calculus. 
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7.1 Syntax and types 

The syntax is that of untyped selective A-calculus with a let construct to introduce 
polymorphism, types being provided by inference. Thus, the syntax of terms is given by: 

M:: = x\ \ t x.M \ M^M' | let x = M inM' 

and the reduction rule corresponding to the new construct is: 

letx = MinN^ [x/M]N. 

As in Damas and Milner's definition [8], types are partitioned into monotypes, ranged 
over by t, and polytypes, ranged over by a. Thus, the language of types is given by: 

w '■'■= u | v (return types) 

r '■'■= {£^t,.. .} (record types) 

t ::= w | r — > w (monotypes) 

a- ::= t | Vv.cr (polytypes) 

where return types u stand for base types and v for type variables. Here again, record types are 
not types of expressions of the term language. 

7.2 Type substitution 

The distinction we introduce here between return types and monotypes is specific to 
selective A-calculus. Indeed, as we shall see, the main difficulty in our system, when compared 
to A-calculus with ML-style polymorphic types, is that function types are always kept flat. 
Observe, indeed, that function types are not return types. For example, {I =>- a} — > {{I' =>- 
P] ~~ * t) is not a valid type expression in our type language. It is possible, however, to obtain 
such an expression as the result of substituting a valid type for a type variable in another valid 
type. For example, doing a direct substitution with {1 =^7} — > 5 for in type {1 =^/3} — > 
wouldresultin{l^({1^7} -» 6)} -> ({1^7} -> S). This means that when we substitute 
a variable that appears as return type with a functional type, we will need to modify the 
structure of the type. 

The solution is to define type substitution with a built-in flattening of the domain type. We 
will denote this operation as [r'\a]r (i.e., substitute type r' for type variable a in r) and it is 
performed as expressed by the following simple rule: 

[(/ - w)\a](r - a) = (([(/ - w)\a]r) • r 1 ) -> «. 

With this rule, our example above results in the valid type {1=^({1=^7} — > ^),2^>7} — > ^ 

This illustrates how our domain of types is radically different from the conventional 
Herbrand universe with the arrow and base type constructors, whose well-known term 
unification is exploited for ML-type inference. We shall thus need to provide our explicit 
unification algorithm. It is a nice property of our system that unique most general unifiers 
exist for our type terms. As we shall see, this is essentially due to the well-foundedness of 
normalization to flattened types which does not change the size of types. 
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7.3 Typing rules 

The typing rules are given in Figure 7. It is interesting to remark that Rules (IV)-(VI) 



r[x I— > a] h x : a 



r[x i-^> 6] \~ M : r — > t 
r h \ t x.M: {1^6} t^t 



r \~ m : {/=s>#} - r — > t r^N-.e 

r h M?N : r 



a not free in T 



r\-M:a 



r h M: Va.o- 



T hM: Va.o- 



fhM: [r\a]o- 



r hM: a r[x o-] h : r 



n-letx = MinA^:r 



(I) 



(II) 



(HI) 



(IV) 



(V) 



(VI) 



Figure 7. Typing rules for polymorphic selective A-calculus 

are in no way specific to selective A-calculus. Since type quantifiers are external, they 
are independent of the structure of monotypes. Thus, these rules are exactly the same 
used in classical A-calculus. Their roles are generalization (IV), instantiation (V), and let- 
introduction (VI). The only, but important, difference between these rules and the classical 
ones is hidden in the use of our flattening type substitution [r\a]a in Rule (V). 

Again, all the desirable properties hold for the polymorphically typed selective A-calculus, 
as expressed by the two following propositions. 

Proposition 7 (subject reduction) If r \- M : t in polymorphically typed selective A- 
calculus, and M — > N, then r \- N : t. 

Theorem 8 (Strong normalization) Polymorphic selective A-calculus is strongly normaliz- 
ing. 

7.4 Type unification 

The key for type synthesis is unification. We give here a unification algorithm for 
the label-selective monotypes defined above. It can be expressed as a simple E-unification 
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problem [11], where the equational theory is that deciding equality of record types. Then, 
our type substitution operation using record-type concatenation constitutes a complete set 
of reduction for this theory. We next give this unification procedure as a complete set of 
equivalence-preserving transformations on a set of type equations. 

A set of type equations cp is said to be in solved form if every equation in it is of the 
form a = r such that the type variable a occurs only once in ip; viz-, as this equation's 
lefthand-side. As usual, such a solved-form defines a variable substitution that can be applied 
to type expressions. 

Figure 8 contains the complete set of transformations for the unification of label-selective 
monotypes. We use the notation a for type variables, w for return types, and r or 6 for any 
type expression. (Again, {} — > u is identified with u.) 

These rules work on a set (a conjunction) of type equations, transforming it into another 
such set. Upon termination, having started from a set <p of equations, the resulting equation 
set is either _L, the inconsistent equation indicating that no solution exists, or sol(yj), a set of 
equations in solved form equivalent to ip. 

In either case, this process can be seen as returning a substitution. In the first case, it is the 
failing substitution _L such that _L (r) =_L for all types r, where _L denotes the inconsistent 
type. In the second case, the solved form sol(yj) is the most general unifier (MGU) of ip (up 
to variable renaming). The rules are written as rewrite rules using a comma as an associative 
and commutative set constructor, and the equal sign as a symmetric equation constructor. That 
is, in these rules the particular order of equations in the set as well as the orientation of an 
equation are irrelevant. As established by the following theorem, they are solution-preserving 
and there is a deterministic strategy that makes them always terminate. 

Theorem 9 (label-selective type unification) There is an algorithm that computes the most 
general unifier of a set of equations on monotypes or reports failure if there is none. 

7.5 Type inference 

It is now easy to derive a type inference algorithm by combining type unification with the 
typing rules of Figure 7. It is sufficient to following the syntactic structure of a given term, 
accumulating new equations in a set, as shown in Figure 9. The function Tp takes a typing 
environment r (a function from term variables to types) and a selective A-term M, and returns 
a pair (<p,t) where (p is a set of type equations in solved form (i.e., a type substitution), and 
r is the principal type of M. The function strip applies to a type expression Vai. • • • Va„.r, 
where n > 0, and returns the expression obtained from r where all the a, s, if any, are replaced 
with fresh names. 6 The expression FV(t) is the set of free variables in r, and by extension 
FV(r) = \J x FV(r(x)). The expression so\(<p), where ip is a set of type equations, is 
the solved form of <p (i.e., the MGU of ip). It is the result of applying the transformation 
rules of Figure 8 to tp until none applies. The expression y(r) is the result of applying 
the substitution <p to the type expression r. By extension, y(-T) is the function defined by 

v {r){x) = v {r{x)). 

This algorithm constructs a derivation tree whose root is r h M : r, where T and M are 
given. Since there is only one way to construct this tree, by induction on the structure of M, 

6 If n = 0 there is no quantifier, and thus strip returns the given type expression as is. 
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(Base type) (Function type) 

r ' u / v r ' u base type 

_|_ u, v base types _|_ r / {} 



(Variable recurrence) 



a = r 



_L 



t ^ a 

a £ Var(r) 



(Variable elimination) 

¥>> a — r a £ Vfcr(yj) - Var(r) 

[r\a]yj, a = T ^ r variable, then r £ Var(<£>) 



(Variable orientation) 



<p, t = a 



a = r 



a variable 
r not variable 



(Redundancy) 

ip, e = e 



(Decomposition) 

if, {1^9} -r^u = {1^9'} ■ r> -> u' 
(p t 9 = 9', r -» w = / -» u' 



(Label completion) 

{£^0} • r -» w = r 1 -» w' '•'/{} 



Figure 8. Equation-rewriting rules for type unification 
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Tp(r,x) = (0,strip(r(x))) 



Tp(r,\tx.M) = (<p,[r\P]{{l^a}^p)) 

' (<p,T) = Tp(r[x^ a],M) 



where 



a, (3 fresh 



Tp(r,MiM') = (sol(<?U ip'U {t = {I^t 1 } -» a}), a) 

' (ip,T) = T P (r,M) 
where ^ (<^',r') = Tp(r,M') 
a fresh 



7>(r, letx = M in M') 



(soI(^iV),t') 

' (<p,T) = T P (r,M) 

where < (^', T ') = Tp(r',M') 

r' = r[x ^ v(fv(t) - Fv(y>(r))).T] 



Figure 9. Type inference algorithm 
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this algorithm is complete and correct. This is because only necessary equations are added, 
except for generalization and instantiation, which are handled in the most general way in the 
variable and let cases. 

8 Conclusion and further work 

We have proposed two typing systems for label-selective A-calculus: simple types and 
ML-style polymorphic types. The latter are smoothly accommodated thanks to the existence 
of a simple but flexible record-type concatenation operation that facilitates building label- 
selective currying right into type substitution and unification. Integrated into a polymorphic 
functional programming language with currying, this provides a powerful tool, extending 
currying facilities and helping to memorize multi-argument functions. 

An interesting subject is how to mix record operations and selective A-calculus. The idea 
comes from the natural encoding of records in the untyped calculus, as: 

{ii^a u ... ,l n ^a n } — > Xseis^s^ai ... f n a n ) 

where sel is a distinguished fixed channel and s is a function selecting a label and discarding 

the others individually (we have no way to discard them at once), like A^xi \i n x n .Xk. 

We can even have functions using more than one label. This is in fact the basic idea for a 
transformation calculus. However, there are some essential differences between a classical 
definition of records and this encoding as it accommodates numerical indices. We suspect that 
type inference of such a calculus with useful operations might turn out to be rather complex. 

Another application of this calculus might be found in parallel processing. If we now 
see labels on a stream as identifying threads, the commutation capability directly interprets a 
concurrent evaluation. This is an idea very close to the dataflow paradigm, but we hope to 
replace flow analysis by type synthesis. Another, but not contradictory, view is to see labels 
as names, like for process communication. It shows a link, which can easily be made more 
evident, with calculi like Milner's 7r-calculus [16]. The conjunction of those two views seems 
an interesting prospective. 

The last, but more immediate, concern is compilation. Two different versions of selective 
A-calculus using de Bruijn indices, through explicit substitutions [1], have been developed. 
They reflect two different levels of compilation: one that is faithful to label names, and one 
where they can be replaced by numeric stack offsets. This might be the basis for an efficient 
compilation method, which should be built on a completely curried vision. That is, there 
should be no overhead caused by currying. The efficient compilation method given by Ohori 
for a record calculus [18] gives us some evidence that this is possible. 
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Appendix: Proofs of theorems 



Proposition 4 Record-type concatenation is associative. 



Proof: Let us show that <fr r o <fr s = 4>r^4> r (s)- We proceed with inverses: 



-l 



i ~ o r (pi) - o s (<f>-J(i)) 



We then have: 



r-(s-t) = r\&4> r (sW<t> s (t)) 



= riS4> r (s)iS4> rltlMs) (t) 
= {r-s)-t. 



I 



Proposition 5 (subject reduction) IfT h M : r and M -> N then r \- N : t. 

Proof: We only need to prove this property when M is a redex and N is the result of this reduction. 
We can then generalize by substitution and repetition. 

If M is a /3-redex, it is of the form (\tx: 6.P) 1Q. Then, the basis of the proof tree is: 



After reduction the result is N = [x/ Q]P. We obtain a derivation tree for r h [x/Q]P from those 
of r[x \— > 6] h P : r — > t and f h Q : 6 as follows: (1) doing all a-conversions necessary to the 
substitution of x by Q; (2) suppressing x in the environments (except where it is redefined by an 
abstraction); (3) where x appears without being defined in the environment, replacing J" h x : 6 by 
the derivation tree of T' \- Q : 6. This poses no problem since VyG FV(Q) r(y) = r'(y). 

If the reduction is a reordering, we have seven cases. We will only work out in detail cases (3), (6) 
and (7). 

Case (3): If the reduction is (3), then the derivation tree must have the following form: 



r[x^6] HP: r^r 



r\- \ t x:6.P: {l=>6}-r^T 



T\x \ — > 6] \~ M : {qn^O 1 } -r^r 



r\-N-.e' 



r h X pm x:6.M: {pm=>6,qn=>6'} ■ r — > r 



r h {\ pm x: 6.M) ~N : {pm=>6} ■ r -» r 



Since x 0 FV(N), we can obtain the following derivation tree after reordering: 
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r[xi->0]\-M:{qn=>0'}-r->T T[x \-> 0]\- N : 0' 

r h \ pm x:6.{M~N) : {pm=>6}-r^T 
Case (6): If the reduction is (6), then n < m, and the derivation tree must have the following form: 
T[x ^ 6] h M : {pn^O 1 } ■ r -> r 

r\- N-.e' 



r h \ pm x:6.M : {pn^6',pm^6} ■ r — > r 

T h (A^x: 0.M) f„iV : {p(m - 1) ^6} ■ r — r 

Since x 0 FV(iV), we can obtain the following derivation tree after reordering: 

r[x^0] hi: {pn^O'}- r^r r[x ^ 0] \- N : 0' 
r[x^6] VM-N-.r^T 



r h \ p{m _ 1)X : 6. (Mf„ AT) : {p{m - 1 ) => 6} ■ r — r 
Case (7): If the reduction is (7), then m < n, and the derivation tree must have the following form: 
r[xh^0] hi: {p{n- \)^6'}-r^T 

r\- N:6' 



r h \ pm x:0.M : {pm^0,pn^0'} ■ r —> r 

rh{\ pm x:0.M)~N:{pm^0}-r^T 

Since x 0 FV(N), we can obtain the following derivation tree after reordering: 

r[x^0] hi: {/?(«- l)=^0'}-r-»T r[xh^6>] \- N : 6' 
r\x^0]hM p ^ ) N:r^ T 



r h \ pm x:e.{Mf n N) : {^m^0} • r — r 



Theorem 6 77ie simply typed selective X-calculus is strongly normalizing. 

Proof: The idea is to construct a function that gives the longest reduction of a term in function of its 
input. By reduction steps, we only mean here /3-reductions, since we already know that reordering 
is Noetherian. 

First, let us define zero functions, and the operation of rectification of a function. In fact, we use 
selective functions in place of classical functions, labeling arguments. They are only a practical 
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notation since we know that selection is deterministic by the confluence theorem, and we could 
translate them to classical functions using their types and the order on labels. 

Let t = {l\ =>■ t\, ...,£„ =>■ t„} — > u be a simple type. The zero-function for r, noted 0 r , is 
the function x\ : t£ , . . . , l n x n : t„*).0, of type t* , where * is defined by induction as 

=^n, ...}-»«)* = {l\ =^Tj*, . . .} — > i'nf (we replace every base type with int). 

To rectify a function/ of type r = {£1 ri , . . . , l n r m } • r — > m to r — > « one simply applies it to 
the corresponding zero-functions: rect(r — > m,/ : r) = f{l\ =>0 ri , ...,£„ =>0 r "). 

We define our function Tp (M) by induction on the structure of the term M, annotated with types in 
some typing environment T. We suppose that keywords S and variables V are independent, and 
use V U S as symbols for the respective selective functions. 

For a variable r h x : r, the associated function is X x x: t*.x. 

For an abstraction r h A^x: 0.M : • r — > r, the associated function is A^x: 0* .rr[ Ah _e](M). 

For an application r h : r — > r, with r \- N : 6,the associated function is: 

T r (M?N) = \ XlXl . ■ ■ ■ .\ Xn x n .((T r (M)~ Xl . . . ? k x k ?N a ) + rect(int,N a : 6*) + 1 ) 

where: 

1. FV(N) ={xi,... ,x„}; 

2. FV(M) n FV(N) = {x u . . .,x k }, 0 < k < n; 

3. N a = T r {N)T l x 1 ...T n x„; 

4. for/ : {l\ =>■ 6\ ,...,£„ =>■ 0 n } — > int and a : int, 

f+a = \(l 1 ^x 1 :0 1 ,...,l„^x„:0„).(f(l 1 ^x 1 ,...,l„^x„) + a). 

This sum of three terms expresses that N may be reduced after substitution in M, or before, and that 
there may be one step of /3-reduction. 

In this function we make two approximations. The first one is that we count one step for each 
application, whether or not there is an abstraction to reduce. The second one is that we take the 
sum of the call-by-name and call-by-value strategies, and not their maximum. Since these are only 
over-estimations, our function gives an upper-bound of the longest reduction path. 

If r h M : {l\ 6\ , . . . , l n 0 n } — > r and r \ F v(m) = {*i ^ n , . . . , x m h-> t„,}, then T r {M) is a 
total function from 6\ x • • • x 0* x Tj* x • • • x r^j to int. This means that on any complete input that 
is coherent with its typing, M will terminate. Moreover, an upper bound of its longest reduction 
path is given by rect(int, Tr{M) : (r — > r)*). | 

Proposition 7 (subject reduction) If T V M : t in polymorphically typed selective A- 
calculus, and M — > N then T h N : r. 

Proof: Since polymorphism can only be used in conjunction with let, the proof for simple types is 
enough except for let-reduction. 

In this last case, the derivation tree starts with: 

r\-M:a r[x t-^> a] \- N : t 
r h let x = M in : r 
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We first perform all a-conversions necessary to the substitution of x by M. After reduction, we 
obtain a tree with root r h [x/M]N : r from the derivation tree of T[x i— > a] h N : r by replacing 
every occurrence of the axiom r'[x t— > a] \- x : aby the derivation tree of J" h M : <r; observing 
thatVye FV(M) r(y) = r'(y). | 

Theorem 8 Polymorphic selective X-calculus is strongly normalizing. 

Proof: We find an upper bound of the longest evaluation of M by that of M, which is M where 
all occurrences of let are suppressed by transforming letx = P'mN into K^([x/P]N)^P, where 
K = Aix.Aiy.x. We need K for the case where x does not appear in N. Since the result is 
monomorphic everywhere, the argument for the simply typed calculus holds. | 

Theorem 9 (label-selective type unification) There is an algorithm that computes the most 
general unifier of a set of equations on monotypes or reports failure if there is none. 

Proof: We first prove the correctness of the rewriting system of Figure 8. That is, for each rewrite 
rule, any solution of the denominator is a solution of the numerator, and conversely, any solution 
of the numerator can be extended into a solution of the denominator, possibly by introducing new 
variables missing in the numerator. 

The rules labeled Base type, Variable recurrence, Function type detect inconsistencies in the 
equations. That is, respectively, equation between two different base types, between a type variable 
and a type containing it, or between a base type and a functional type. When one of these rules 
applies, the system has no unifier. 

The Variable eliminationmle substitutes variables (using flattening type substitution), while keeping 
their referents. Let a be a solution of the numerator. Then, by construction, cr(a) = cr(r), thus it 
is also a solution of the denominator; and conversely. 

The Variable orientationrule simply reorients an equation. It is not really necessary and is provided 
only to obtain the solved form with all solved variables on the left. Clearly, it leaves unchanged the 
set of unifiers. So does the Redundancy rule which just suppresses tautological equations. 

Decomposition takes a label already present on the two sides of an equation, and equates the types. 
Correctness is clear. 

Whenever a label appears only on one side of an equation, it is necessary to introduce it in the other 
side. This is done by the Label completion rule using record-type matching. Any unifier of the 
denominator is also a solution of the numerator, since i±) r 1 — > a = r 1 ■ cj)~, 1 {i^0} — > a, 

which is by unification equal to r' — > ui' . Conversely, if a is a unifier of the numerator, then it 
maps u' to a functional type of the form <f>~, 1 {£^a(6)} ■ r" — > a/', which can be extended for the 
denominator by adding cr(a) = r" — > ui" . 

We next prove that there is a terminating strategy. Termination follows for the well-foundedness of 
a decreasing measure. A variable is solved when it appears only once, and as the lefthand-side of 
an equation. We exhibit a strategy that reduces the lexicographical measure (number of unsolved 
variables, sum of sizes), where the size of a type is the total number of labels, variable occurrences 
and base types it contains. 

The three failing rules terminate. Redundancy, and Decomposition reduce the sum of sizes. 
Variable elimination and Variable orientation reduce the number of unsolved variables. 

Label completion by itself does not reduce the measure. But if it is always used it in combination 
with Decomposition on the same equation, eliminating or failing as soon as possible, this always 
reduces the number of unsolved variables. If ui' is not a variable, we fail immediately. Otherwise, 
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it is solved, but we create a new variable a. We repeat this until we can solve a "successor" of a 
with the left hand side (which may suppose creating a lineage to u> too, if completion is mutual). 
This sequence terminates, since there is only a finite number of labels on each side. 

Last, we must show that our result is in solved form. First, in every equation, at least one side is 
a solved variable. If the two sides are functional types, then either Decomposition or Completion 
applies. If one side is a base type, then the other side is a solved variable, otherwise Elimination, 
Redundancy or some failure applies. If the two sides are variables, then at least one is solved. 

We construct the substitution a by taking for each equation a = r, a solved, cr(a) = r. a is a 
most general unifier of the final system, and, as a consequence, if we suppress definitions for all 
variables introduced by completion, a' is a most general unifier of the original system. | 
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